NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Exploring the Benefits and Applications of Video-Span Selection and Search for Real-Time Support in Sign Language Video Comprehension among ASL Learners

https://doi.org/10.1145/3690647

Hassan, Saad; de_Lacerda_Pataca, Caluã; Amin, Akhter Al; Nourian, Laleh; Navarro, Diego; Lee, Sooyeon; Gordon, Alexis; Watkins, Matthew; Tigwell, Garreth W; Huenerfauth, Matt (September 2024, ACM Transactions on Accessible Computing)

People learning American Sign Language (ASL) and practicing their comprehension skills will often encounter complex ASL videos that may contain unfamiliar signs. Existing dictionary tools require users to isolate a single unknown sign before initiating a search by selecting linguistic properties or performing the sign in front of a webcam. This process presents challenges in extracting and reproducing unfamiliar signs, disrupting the video-watching experience, and requiring learners to rely on external dictionaries. We explore a technology that allows users to select and view dictionary results for one or more unfamiliar signs while watching a video. We interviewed 14 ASL learners to understand their challenges in understanding ASL videos, strategies for dealing with unfamiliar vocabulary, and expectations for anin situdictionary system. We then conducted an in-depth analysis with eight learners to examine their interactions with a Wizard-of-Oz prototype during a video comprehension task. Finally, we conducted a comparative study with six additional ASL learners to evaluate the speed, accuracy, and workload benefits of an embedded dictionary-search feature within a video player. Our tool outperformed a baseline in the form of an existing online dictionary across all three metrics. The integration of a search tool and span selection offered advantages for video comprehension. Our findings have implications for designers, computer vision researchers, and sign language educators.
more » « less
Full Text Available
Understanding How Deaf and Hard of Hearing Viewers Visually Explore Captioned Live TV News

https://doi.org/10.1145/3587281.3587287

Amin, Akhter Al; Hassan, Saad; Lee, Sooyeon; Huenerfauth, Matt (April 2023, Proceedings of the 20th International Web for All Conference)

Full Text Available
Modeling Word Importance in Conversational Transcripts: Toward improved live captioning for Deaf and hard of hearing viewers

https://doi.org/10.1145/3587281.3587290

Amin, Akhter Al; Hassan, Saad; Huenerfauth, Matt; Alm, Cecilia Ovesdotter (April 2023, Proceedings of the 20th International Web for All Conference)

Full Text Available
Who is speaking: Unpacking In-text Speaker Identification Preference of Viewers who are Deaf and Hard of Hearing while Watching Live Captioned Television Program

https://doi.org/10.1145/3587281.3587286

Amin, Akhter Al; Mendis, Joseph; Kushalnagar, Raja; Vogler, Christian; Huenerfauth, Matt (April 2023, W4A '23: Proceedings of the 20th International Web for All Conference)

Live TV news and interviews often include multiple individuals speaking, with rapid turn-taking, which makes it difficult for viewers who are Deaf and Hard of Hearing (DHH) to follow who is speaking when reading captions. Prior research has proposed several methods of indicating who is speaking. While recent studies have observed various preferences among DHHviewers for speaker identification methods for videos with different numbers of speakers onscreen, there has not yet been a study that has systematically explored whether there is a formal relationship between the number of people onscreen and the preferences among DHH viewers for how to indicate the speaker in captions.We conducted an empirical study followed by a semi-structured interview with 17 DHH participants to record their preferences among various speaker-identifier types for videos that vary in the number of speakers onscreen. We observed an interaction effect between DHH viewers’ preference for speaker identification and the number of speakers in a video. An analysis of open-ended feedback from participants revealed several factors that influenced their preferences. Our findings guide broadcasters and captioners in selecting speaker-identification methods for captioned videos.
more » « less
Support in the Moment: Benefits and use of video-span selection and search for sign-language video comprehension among ASL learners

https://doi.org/10.1145/3517428.3544883

Hassan, Saad; Amin, Akhter Al; de Lacerda Pataca, Caluã; Navarro, Diego; Gordon, Alexis; Lee, Sooyeon; Huenerfauth, Matt (October 2022, Proceedings of the 24th International ACM SIGACCESS Conference on Computers and Accessibility)

Full Text Available
Design and Evaluation of Hybrid Search for American Sign Language to English Dictionaries: Making the Most of Imperfect Sign Recognition

https://doi.org/10.1145/3491102.3501986

Hassan, Saad; Amin, Akhter Al; Gordon, Alexis; Lee, Sooyeon; Huenerfauth, Matt (April 2022, Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems)

Searching for the meaning of an unfamiliar sign-language word in a dictionary is difficult for learners, but emerging sign-recognition technology will soon enable users to search by submitting a video of themselves performing the word they recall. However, sign-recognition technology is imperfect, and users may need to search through a long list of possible results when seeking a desired result. To speed this search, we present a hybrid-search approach, in which users begin with a video-based query and then filter the search results by linguistic properties, e.g., handshape. We interviewed 32 ASL learners about their preferences for the content and appearance of the search-results page and filtering criteria. A between-subjects experiment with 20 ASL learners revealed that our hybrid search system outperformed a video-based search system along multiple satisfaction and performance metrics. Our findings provide guidance for designers of video-based sign-language dictionary search systems, with implications for other search scenarios.
more » « less
Full Text Available
Using BERT Embeddings to Model Word Importance in Conversational Transcripts for Deaf and Hard of Hearing Users

https://doi.org/10.18653/v1/2022.ltedi-1.5

Amin, Akhter Al; Hassan, Saad; Alm, Cecilia; Huenerfauth, Matt (January 2022, Proceedings of the Second Workshop on Language Technology for Equality, Diversity and Inclusion)

Deaf and hard of hearing individuals regularly rely on captioning while watching live TV. Live TV captioning is evaluated by regulatory agencies using various caption evaluation metrics. However, caption evaluation metrics are often not informed by preferences of DHH users or how meaningful the captions are. There is a need to construct caption evaluation metrics that take the relative importance of words in transcript into account. We conducted correlation analysis between two types of word embeddings and human-annotated labelled word-importance scores in existing corpus. We found that normalized contextualized word embeddings generated using BERT correlated better with manually annotated importance scores than word2vec-based word embeddings. We make available a pairing of word embeddings and their human-annotated importance scores. We also provide proof-of-concept utility by training word importance models, achieving an F1-score of 0.57 in the 6-class word importance classification task.
more » « less
Full Text Available

Search for: All records